Open DMIX: High Performance Web Services for Distributed Data Mining

نویسندگان

  • Robert Grossman
  • Yunhong Gu
  • Chetan Gupta
  • David Hanley
  • Xinwei Hong
  • Parthasarathy Krishnaswamy
  • Robert L. Grossman
چکیده

In this note, we introduce Open DMIX, an open source collection of web services for the mining, integration, and exploration of remote and distributed data. We also describe some preliminary experimental results using Open DMIX. Open DMIX is layered: the top layer provides templated data mining and statistical algorithms, such as those defined by the Predictive Model Markup Language [8]. The middle layer provides access and integration of remote and distributed data using the DataSpace Transfer Protocol (DSTP) [6]. The bottom layer provides specialized network protocols designed to work with large distributed data sets over wide area networks, which may have high bandwidth delay products (BDPs). Open DMIX clients interact with Open DMIX servers using a version of web services designed for high performance applications, which we call SOAP+. Robert Grossman is also with Open Data Partners.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Experimental Studies Scaling Web Services For Data Mining Using Open DMIX: Preliminary Results

We have developed Open DMIX, which is an open source collection of web services for accessing, exploring, integrating, and mining remote and distributed data. Open DMIX clients interact with Open DMIX servers using a version of web services designed for high performance applications, which we call SOAP+. In this paper we describe experimental studies comparing SOAP and SOAP+ based data mining a...

متن کامل

The Weka4WS framework for distributed data mining in service-oriented Grids

The service oriented architecture (SOA) paradigm can be exploited for the implementation of data and knowledge-based applications in distributed environments. The Web Services Resource Framework (WSRF) has recently emerged as the standard for the implementation of Grid services and applications. WSRF can be exploited for developing high-level services for distributed data mining applications. T...

متن کامل

High Fuzzy Utility Based Frequent Patterns Mining Approach for Mobile Web Services Sequences

Nowadays high fuzzy utility based pattern mining is an emerging topic in data mining. It refers to discover all patterns having a high utility meeting a user-specified minimum high utility threshold. It comprises extracting patterns which are highly accessed in mobile web service sequences. Different from the traditional fuzzy approach, high fuzzy utility mining considers not only counts of mob...

متن کامل

WSRF Services for Composing Distributed Data Mining Applications on Grids: Functionality and Performance

The Web Services Resource Framework (WSRF) has recently emerged as the standard for the implementation of Grid applications. WSRF can be exploited for developing high-level services for distributed data mining applications. This paper describes Weka4WS, a framework that extends the widely-used Weka toolkit for supporting distributed data mining on WSRF-enabled Grids. Weka4WS adopts the WSRF tec...

متن کامل

Exploiting need of Service-Oriented Framework for Executing Data Mining Services

Weka4WS adopts the emerging Web Services Resource Framework (WSRF) for accessing remote data mining algorithms and managing distributed computations. The Weka4WS user interface is a modified Weka Explorer environment that supports the execution of both local and remote data mining tasks. Workflow environments are widely used in data mining systems to manage data and execution flows associated t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004